Noname manuscript No. 

(will be inserted by the editor) 



Jan Chomicki 



O 
O 

(N 



in 



Database Querying under Changing 
Preferences 



Received: date / Accepted: date 



Abstract We present here a formal foundation for an iterative and incremental 
approach to constructing and evaluating preference queries. Our main focus is 
on query modification: a query transformation approach which works by revising 
the preference relation in the query. We provide a detailed analysis of the cases 
where the order-theoretic properties of the preference relation are preserved by the 
revision. We consider a number of different revision operators: union, prioritized 
and Pareto composition. We also formulate algebraic laws that enable incremental 
evaluation of preference queries. Finally, we consider two variations of the basic 
framework: finite restrictions of preference relations and weak-order extensions 
of strict partial order preference relations. 

Keywords preference queries • preference revision • query evaluation • strict 
partial orders • weak orders 

o 

Q , 1 Introduction 

The notion of preference is common in various contexts involving decision or 
choice. Classical utility theory (Fis70) views prefer ences as binary relations . This 
view has recently been adopted in database research ( C ho02l : ICho03l : lKie02tlKk 02). 
where preference relations are used in formulating preference queries. In AI, 
various ap proaches to compact specification of preferences have been explored 
(BB D + 04l) . The semantics underlying such approaches typically relies on prefer- 
ence relations between worlds. 
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Preferences ca n be embedded into da t abase q uery languages in several dif- 
ferent ways. First, dCho02t Icho03t lKie02l lKK02ll propose to introduce a special 
operator "find all the most preferred tupl es according to a given preference rela- 
tion." This operator is called winnow in JCho02l : ICh o03). A special cas e of win- 
now is c alled skyline (BKS01) and has been recently extensively studied (PTFS03; 
BGZ04). Second, ( AWOfl UHPOA assume that preference relations are defined us- 
ing numeric utility functions and queries return tuples ordered by the values of 
a supplied utility function. It is well-known that numeric utility functions cannot 
represent all strict partial o rders (|Fis7fi) . not even those that occur in database ap- 
plications in a natural way (Cho03D- For example, utility functions cannot capture 
skylines. Also, ordered relations go beyond the classical relational model of data. 
The evaluation and optimization of queries over such relations requires significant 
changes to relational query processors and optimizers (ISWGA04). On the other 
hand, winnow can be seamlessly combined with any relational operators. 

We adopt here t he first approach, based on winnow, within the preference 
query framework of (Cho03) (a similar model was described in (Kie02)). In this 
framework, preference relations between tuples are defined by first-order logical 
formulas. 

Example 1 Consider the relation Car(Make,Year) and the following preference 
relation >-c, between Car tuples: 

within each make, prefer a more recent car, 

which can be defined as follows: 

(m,y) )~C\ i m \y') = m = in' Ay > y'. 

The winnow operator (Oq 1 returns for every make the most recent car available. 
Consider the instance r\ of Car in Figure QJ. The set of tuples (Oc x ls shown 
in Figure^). 
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Fig. 1 (a) The Car relation; (b) Winnow result 



In this paper, we focus on preference queries of the form a>y (R), consisting of 
a single occurrence of winnow. Here y is a preference relation (typically defined 
by a formula), and R is a database relation. The relation R represents the space of 
possible choices. We also briefly discuss how our results can be applied to more 
general preference queries. 
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Past work on preference queries has made the assumption that preferences are 
static. However, this assumption is often not satisfied. User preferences change, 
sometimes as a direct consequence of evaluating a preference query. Therefore, 
we view preference querying as a dynamic, iterative process. The user submits a 
query and inspects the result. The result may be satisfactory, in which case the 
querying process terminates. Or, the result may be too large or too small, contain 
unexpected answers, or fail to contain expected answers. By inspecting the query 
answer, the user may realize some previously unnoticed aspects of her preferences. 
It is also possible that not all the relevant data was included in the database over 
which the preference query is evaluated. 

So if the user is not satisfied with the preference query result, she has several 
further options: 

Modify and resubmit the query. This is appropriate if the user decides to refine 
or change her preferences. For example, the user may have started with a partial or 
vague concept of her preferences (PFT03). We consider here query modification 
consisting of revising the preference relation >-, although, of course, more general 
transformations may also be envisioned. 

Update the database. This is appropriate if the user discovers that there are 
more (or fewer) possible choices than originally envisioned. For example, in com- 
parison shopping the user may have discovered a new source of relevant data. 

In this context we pursue the following research challenges: 

Defining a repertoire of suitable preference relation revisions. In this work, 
we consider revisions obtained by composing the original preference relation with 
a new preference relation, and transitively closing the result (to guarantee tran- 
sitivity). We study different composition operators: union, and prioritized and 
Pareto composition. Those operators represent several basic ways of combining 
preferences and have already been incorporated into preference query languages 
dCho03l : lKie02h . The operators reflect different user attitudes towards preference 
conflicts. (A conflict is, intuitively, a situation in which two preference relations 
order the same pair of tuples differently.) Union ignores conflicts (and thus such 
conflicts need to be prevented if we want to obtain a preference relation which 
is a strict partial order). Prioritized composition resolves preference conflicts by 
consistently giving priority to one of the preference relations. Pareto composition 
resolves conflicts in a symmetric way. We emphasize that revision is done using 
composition because we want the revised preference relation to be uniquely de- 
fined in the same first-order language as the original preference relation. Clearly, 
the revision repertoire that we study in this paper does not exhaust all meaningful 
scenarios. One can also imagine approaches wh ere axio matic properties of pref- 
erence revisions are studied, as in belief revision 

Identifying essential properties of preference revisions. We claim that revisions 
should preserve the order-theoretic properties of the original preference relations. 
For example, if we start with a preference relation which is a strict partial order, the 
revised relation should also have those properties. This motivates, among others, 
transitively closing preference relations to guarantee transitivity. Preserving order- 
theoretic properties of preference relations is particularly important in view of the 
iterative construction of preference queries where the output of a revision can 
serve as the input to another one. We study both necessary and sufficient condi- 
tions on the original and revising preference relations that yield the preservation of 
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their order-theoretic properties. Necessary conditions are connected with the ab- 
sence of preference conflicts. However, such conditions are typically not sufficient 
and stronger assumptions about the preference relations need to be made. Some- 
what surprisingly, a special class of strict partial orders, interval orders, plays an 
important role in this context. The conditional preservation results we establish 
in this paper supplement those in (Cho03; Kie02) and may be used in other con- 
texts where preference relations are composed, for example in the implementation 
of preference query languages. Another desirable property of revisions is mini- 
mality in some well-defined sense. We define minimality in terms of symmetric 
difference of preference relations but there are clearly other possibilities. 

Incremental evaluation of preference queries. At each point of the interaction 
with the user, the results of evaluating previous versions of the given preference 
query are available. Therefore, they can be used to make the evaluation of the 
current query more efficient. For both the preference revision and database up- 
date scenarios, we formulate algebraic laws that validate new query evaluation 
plans that use materialized results of past query evaluations. The laws use order- 
theoretic properties of preference relations in an essential way. 

Example 2 Consider Example ^ Seeing the result of the query <BCi( r l)i a user 
may realize that the preference relation >-q is not quite what she had in mind. 
The result of the query may contain some unexpected or unwanted tuples, for 
example ti,. Thus the preference relation needs to be modified, for example by 
revising it with the following preference relation >~c 2 ■ 

(m,y) y Cl («',/) e m = "VW" A fflV "VW" Ay = /. 

As there are no conflicts between and >~c 2 ' me user chooses union as the 
composition operator. However, to guarantee transitivity of the resulting prefer- 
ence relation, >-C] U ^c 2 nas t0 be transitively closed. So the revised relation is 
~^c*= TC(yc\ U >~c 2 )- (The explicit definition of >-c* ls given in Example^)]) The 
tuple f3 is now dominated by t% (i.e., tz >~c* ti) and will not be returned to the user. 

The plan of the paper is as follows. In Section|2| we define the basic notions. In 
Section|3] we introduce preference revision. In Section|4] we discuss query modifi- 
cation and the preservation by revisions of order-theoretic properties of preference 
relations. In Section|5] we discuss incremental evaluation of preference queries in 
the context of query modification and database updates. Subsequently, we consider 
two variations of our basic framework: (finite) restrictions of preference relations 
(Section |6|l and weak-order extensions of strict partial order preference relations 
(Section^). We briefly discuss related work in Section|S]and conclude in Section 

HI 

2 Basic notions 

We are working in the context of the relational model of data. Relation schemas 
consist of finite sets of attributes. For concreteness, we consider two infinite, 
countable domains: 3 (uninterpreted constants, for readability shown as strings) 
and £2 (rational numbers), but our results, except where explicitly indicated, hold 
also for finite domains. We assume that database instances are finite sets of tuples. 
Additionally, we have the standard built-in predicates. 
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2.1 Preference relations 



We adopt here the framework of |Cho03). 



Definition 1 Given a relation schema R(A\ ■■ -A^) such that [/,-, 1 < i < k, is the 
domain (either S> or J3) of the attribute A,-, a relation y is a preference relation 
over R if it is a subset of (t/i x • • • x £4) X (U\ X • • • X £4). 

Although we assume that database instances are finite, in the presence of infi- 
nite domains preference relations can be infinite. 

Typical properties of a preference relation >- include (Fis70): 

- irreflexivity: Vx. x )f- x; 

- transitivity: Vx,y,z- (x y y Ay y z) =4> xy z; 

- negative transitivity: \fx,y,z- (x >f- y Ay )f- z) => x )f z; 

- connectivity: \/x,y. x y y Wy y x Vx = y; 

- strict partial order (SPO) if >- is irreflexive and transitive; 



- weak order (WO) if y is a negatively transitive SPO; 

- total order if >- is a connected SPO. 

Every total order is a WO; every WO is an 10. 

Definition 2 A preference formula (pf) C(?i,?2) is a first-order formula defining a 
preference relation yc in the standard sense, namely 



An intrinsic preference formula ( ipf) is a preference formula that uses only built-in 
predicates. 

By using the notation yc for a preference relation, we assume that there is an 
underlying pf C. Occasionally, we will limit our attention to ipfs consisting of the 
following two kinds of atomic formulas (assuming we have two kinds of variables: 
^-variables and i2-variables): 

- equality constraints: x = y, x ^ y, x = c, or x ^ c, where x and y are S>- 
variables, and c is an uninterpreted constant; 

- rational-order constraints: xXy or xXc, where A G {=, 7^, <,>,<, >}, x and 
y are ^-variables, and c is a rational number. 

An ipf all of whose atomic formulas are equality (resp. rational-order) constraints 
will be called an equality (resp. rational-order) ipf. If both equality and rational- 
order constraints are allowed in a formula, the formula will be c alled ERO. Clearly, 
ipfs are a special case of general constraints (KLPOO; KKR95), and define fixed, 
although possibly infinite, relations. 

Proposition 1 Satisfiability of quantifier-free ERO formulas is in NR 




\/x,y,z,w. (x y y Az y w) => (x y wVz y y); 



h y c t 2 mc(h,h). 
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Proof Satisfiability of conjunctions of atomic ERO constraints can be checked in 
linear time ( GSW96). In an arbitrary quantifier-free ERO formula negation can be 
eliminated. Then in every disjunction one needs to nondeterministically select one 
disjunct, ultimately obtaining a conjunction of atomic constraints. □ 

Proposition Q] implies that all the properties that can be polynomially reduced 
to validity of ERO formulas, for example all the order-theoretic properties listed 
above, can be decided in co-NR 

Every preference relation y generates an indifference relation ~: two tuples 
fi and ti are indifferent (t\ ~ t%) if neither is preferred to the other one, i.e., f i )f t 2 
and ?2 )f- 1\ . We denote by ~ c the indifference relation generated by >-£. 

Composite preference relations are defined from simpler ones using logical 
connectives. We focus on the following basic ways of composing preference rela- 
tions over the same schema: 

- union: t\ (V i U ^2) h iff ?i >-i t 2 V?i >-2 t 2 ; 

- prioritized composition: t\ (y\ > y 2 ) t% iff t\ y\ t2 V (f2 )f\ t\ Afi y 2 h)\ 

- Pareto composition: 

t\ (>-l®^2)^2 iff (h >\t 2 ht 2 ^2?l)V(fi y 2 t 2 At 2 )f\ fl). 

We will use the above composition operators to construct revisions of given pref- 
erence relations. We also consider transitive closure: 

Definition 3 The transitive closure of a preference relation >- over a relation 
schema R is a preference relation TCfV) over R defined as: 

(fi ,f 2 ) G TC(y) iff fi y" t 2 for some n > 0, 

where: 

fi y x t 2 = t x yt 2 

h >-' !+1 f 2 = 3f 3 .fi ^f 3 Af 3 y" t 2 . 

Clearly, in general Definition [3] leads to infinite formulas. However, in the 
cases that we consider in this paper the preference relation yrcly) w iU m f act be 
defined by a finite formula. 

Proposition 2 Transitive closure of every preference relation defined by an ERO 
ipfis definable using an ERO ipfof at most exponential size, which can be com- 
puted in exponential time. 

Proof This is because transitive closure can be expressed in Datalog and the eval- 
uation of Datalog programs over equality an d rational -order constraints terminates 
in exponential time (combined complexity) (KKR95). □ 

In the cases mentioned above, the transitive closure of a given preference rela- 
tion is a relation definable in the signature of the preference formula. But clearly 
transitive closure, unlike union and prioritized or Pareto composition, is itself not 
a first-order definable operator. 
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2.2 Winnow 

We define now an algebraic operator that picks from a given relation the set of the 
most preferred tuples, according to a given preference relation. 

Definition 4 ( ChoQ^t) If R is a relation schema and >- a preference relation over 
R, then the winnow operator is written as G) y (R), and for every instance r of R: 

(Dy(r) ={t er\ -Bt' er.t'y t}. 

If a preference relation is defined using a pf C, we write simply (Oc instead of (0y c . 
A preference query is a relational algebra query containing at least one occurrence 
of the winnow operator. 



3 Preference revisions 

The basic setting is as follows: We have an original preference relation y and re- 
vise it with a revising preference relation yo to obtain a revised preference relation 
>-'. We also call y' a revision of >-. We assume that y, y^, and y' are preference 
relations over the same schema, and that all of them satisfy at least the properties 
of SPOs. 

In our setting, a revision is obtained by composing y with >-q using union, 
prioritized or Pareto composition, and transitively closing the result (if necessary 
to obtain transitivity). However, we formulate some properties, like minimality or 
compatibility, in more general terms. 

To define minimality, we order revisions using the symmetric difference (A). 

Definition 5 Assume y i and y 2 are two revisions of a preference relation y with 
a preference relation >-o. We say that >-j is closer than y% to y if y\Ay C 

To further describe the behavior of revisions, we define several kinds of pref- 
erence conflicts. The intuition here is to characterize those conflicts that, when 
eliminated by prioritized or Pareto composition, reappear if the resulting prefer- 
ence relation is closed by transitivity. 

Definition 6 A 0-conflict between a preference relation y and a preference re- 
lation ^0 is a pair (ti,t 2 ) such that t\ >-o ti and t% y t\. A l-conflict between y 
and ^0 is a pair (t\ , t 2 ) such that t\ y^ ti and there exist s\ , . . .s\, k > 1, such that 
t 2 y s\ y ■ ■ ■ y s k y t\ and t\ s k ^ • • • ^0 s\ ^0 h- A 2-conflict between y and 
yo is a pair (h,t2) such that there exist s\, . . -,Sk,k > 1 and wi,... ,w m , m>l, such 

thatt 2 ysi > — y s k yt\, t\ ^ s k ^ ••• ^o*t ^oh, h y§w\ >-o ••• yow m yt 2 , 

and t2 )/- w m ■■ ■ ^ W\ )f- 1\ 

A l-conflict is a 0-conflict if >- is an SPO, but not necessarily vice versa. A 2- 
conflict is a l-conflict if >-o is an SPO. The different kinds of conflicts are pictured 
in Figures |3 and |3] denotes the complement of >-). 
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(a) (b) 
Fig. 2 (a) O-conflict; (b) 1 -conflict 




Fig. 3 2-conflict 



Example 3 If ^o= and >-= {{b,a)}, then (a,£>) is a O-conflict which is 

not a 1-conflict. If we add (b,c) and (c,a) to >-, then the conflict becomes a 1- 
conflict (s\ = c). If we further add (c,b) or (a,c) to >-o, then the conflict is not a 
1-conflict anymore. On the other hand, if we add (a,d) and (d,b) to >-n instead, 
then we obtain a 2-conflict. 

We assume here that the preference relations >- and >~o are SPOs. If >-'= 
TC(y U >~o)> then for every O-conflict between >- and )^o, we still obviously have 
t\ >-' ?2 and t2 >-' t\. Therefore, we say that the union does not resolve any con- 
flicts. On the other hand, if >-'= TC(>~o>>-), then for each O-conflict (fi,f2)> 
fi >-o > >- H and — >(f2 >~o > >- h)- In the case of 1-conflicts, we get again t\ >-' t2 
and ?2 >-' ?i. But in the case where a O-conflict is not a 1-conflict, we get only 
t\ >-' ?2. Thus we say that prioritized composition resolves those O-conflicts that 
are not 1-conflicts. Finally, if >-'= rC(>- <E> >-o), then for each 1-conflict (h,t2), 
->(ti >~ <g> >~o h) and -ifo >- ® fi). We get fi >-' f 2 and f 2 >-' t\ if the conflict 
is a 2-conflict, but if it is not, we obtain only t 2 >-' fi- Thus we say that Pareto 
composition resolves those 1-conflicts that are not 2-conflicts. (Pareto composi- 
tion resolves also conflicts that are symmetric versions of 1-conflicts, with and 
>- interchanged, which are not 2-conflicts.) 
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We now characterize those combinations of y and that avoid different 
kinds of conflicts. 

Definition 7 A preference relation >- is i-compatible(i = 0,1,2) with a preference 
relation if there are no /-conflicts between y and ^o- 

0- and 2-compatibility are symmetric. 1 -compatibility is not necessarily symmet- 
ric. For SPOs, O-compatibility implies 1 -compatibility and 1 -compatibility im- 
plies 2-compatibility. Examples \l\ and |3 show a pair of 0-compatible relations. 
O-compatibility of y and >-o does not require the acyclicity of y U yo or that one 
of the following hold: >- C y Q Qy,o\: yny = 0. 

Propositions^and|2]imply that all the variants of compatibility defined above 
are decidable for ERO ipfs. For example, 1 -compatibility is expressed by the con- 
dition y^ 1 nrC(> — >-g ) = where y^ 1 is the inverse of the preference rela- 
tion >-o- 

O-compatibility of y and ls a necessary condition for TC(y U >-o) to be 
irreflexive, and thus an SPO. Similar considerations apply to TC(yo D> y) and 1- 
compatibility, and TC(y <E)yo) and 2-compatibility. In the next section, we will 
see that those conditions are not sufficient, further restrictions on the preference 
relations will be introduced. 

We conclude by noting the relationships between the three notions of prefer- 
ence composition introduced above. 

Lemma 1 For every preference relations y and yo 
and if yo and y are 0-compatible 

yo ® y = yo > y = yo u y. 



4 Query modification 

In this section, we study preference query modification . A given preference 
query O)y(R) is transformed to the query (O y i(R) where y' is obtained by com- 
posing the original preference relation >- with the revising preference relation >-rj, 
and transitively closing the result. (The last step is clearly unnecessary if the ob- 
tained preference relation is already transitive.) We want y' to satisfy the same 
order- theoretic properties as >- and >-o, and to be minimally different from >-. To 
achieve those goals, we impose additional conditions on y and ^o- 

For every 9 G {U, > , ®}, we consider the order-theoretic properties of the pref- 
erence relation y' = yo 9 y, or y' = TC(yo 9 y) if yo 9 y is not guaranteed 
to be transitive. To ensure that this preference relation is an SPO, only irreflexivity 
has to be guaranteed; for weak orders one has also to establish negative transitivity. 

1 The term query modification was used in early relational systems like INGRES to denote 
a technique that produced a changed version of a query submitted by a user. The changes were 
meant to incorporate view definitions, integrity constraints and security specifications. We feel 
that it is justified to use the same term in the context of composition of a preference relation in 
a query with some other preference relation, to produce a new query. 
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4.1 Strict partial orders 

SPOs have several important properties from the user's point of view, and thus 
the ir prese rvation is desirable. For instance, all the pref erence relations defined 
in l|Kie02) and in the language Preference SQL (KK02) are SPOs. Moreover, if 
>- is an SPO, then the winnow ft)>_(r) is nonempty if (a finite) r is nonempty. The 
fundament al algori thms for computing winnow require that the preference relation 
beanSPO(Cho03). Also, in that case incremental evaluation of preference queries 
becomes possible (Proposition [5] and TheoremQl. 

Theorem 1 For every 0-compatible preference relations y and yo such that one 
is an interval order (10) and the other an SPO, the preference relation TC(>~o y), 
where 9 G {U, >, ®}, is an SPO. If the IO is a WO, then TC{y 9 y) =y 9 y. 

Proof By Lemma ^ O-compatibility implies that >-q U y = y<z > y = yo >-• 
Thus, WLOG we consider only union. Assume is an IO. If TC(y U >~o) is not 
irrefiexive, then >- U has a cycle. Consider such cycle of minimum length. It 
consists of edges that are alternately labeled >-q (only) and y (only). (Otherwise 
the cycle can be shortened). If there is more than one non-consecutive ^o-edge 
in the cycle, then being an IO implies that the cycle can be shortened. So the 
cycle consists of two edges: t\ >-q t% and t% yt\. But this is a 0-confiict violating 
O-compatibility of y and yo. □ 

It is easy to see that there is no preference relation which is an SPO, contains 
>- U yo, and is closer (in the sense of Definition|5j to >- than TC(y U yo). 

As can be seen from the above proof, the fact that one of the preference re- 
lations is an interval order makes it possible to eliminate those paths (and thus 
also cycles) in TC(y U >-o) that interleave y and y$ more than once. In this way 
acyclicity reduces to the lack of 0-confiicts. 

It seems that the interval order (IO) requirement in Theorem^cannot be weak- 
ened without needing to strengthen the remaining assumptions. If neither of y and 

is an IO, then we can find such elements x\, y\, zi, w\, X2, y% z%, W2 that 

x\ yy\,zi ywi,x\ ^-wi,z\ ^y\,x 2 yoyi,Z2 yow 2 ,x 2 w 2 , 

and z 2 yi- If we choose y\ = x 2 , z\ = yi, w\ = z 2 , and x\ = w 2 , then we get a 
cycle in y U yo. Note that in this case y and are still 0-compatible. Also, there 
is no SPO preference relation which contains >- U because each such relation 
has to contain TC(y U yo). This situation is pictured in Figure^] 

Example 4 Consider again the preference relation y^ ■ 

(m,y) y^ (tn',y') = m = m! Ay > y'. 

Suppose that the new preference information is captured as yc 3 which is an IO 
but not a WO: 



(m,y) y Cj (m',y') =m = "VW"Av= 1999 Am' = "Kia"A/ = 1999. 
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Fig. 4 A cycle for 0-compatible relations that are not IOs. 

Then TC(^q U >-c 3 X which properly contains >~a U ^C 3 > is defined as the SPO 

(m,y) ^c 4 ( m ',y') = m=m'Ay> y'V 

m = "VW" Ay> 1999 A m' = "Kia" Ay' < 1999. 

TheoremQJimplies that if y and >-q are 0-compatible and one of them contains 
only one pair, then TC(y U >-o) is an SPO. So what will happen if we break up 
the preference relation from Figure @]into two one-element relations >-j and 
>-2 and attempt to apply Theorem^twice? Unfortunately, such a "strategy" does 
not work. The second step is not possible because the preference relation y 2 is 
not 0-compatible with the revision of >- with y \ . 

For dealing with prioritized composition, O-compatibility can be replaced by a 
less restrictive condition, 1 -compatibility, because prioritized composition already 
provides a way of resolving some conflicts. 

Theorem 2 For every preference relations y and y^ such that y$ is an 10, y is 
an SPO and y is l-compatible with yo, the preference relation TC(yo > y) is an 
SPO. 

Proof We assume that TC(yo r> >-) is not irreflexive and consider a cycle of min- 
imum length in >-q > >-. K the cycle has two non-consecutive edges labeled (not 
necessarily exclusively) by yo, then it can be shortened, because is an 10. The 
cycle has to consist of an edge t\ yoh and a sequence of edges (labeled only by 
y): t 2 y h,.. . ,f„_i y t n ,t n y t\ such that n > 2. and t\ /o 'n • ■ • h h- 
(We cannot shorten sequences of consecutive y -edges because y is not necessar- 
ily preserved in >-q t> y.) Thus (t\,t2) is a 1-conflict violating 1 -compatibility of 
y with >-o. □ 

Clearly, there is no SPO preference relation which contains > y, and is 
closer to >- than TC(yo t> >-). Violating any of the conditions of Theorem [5] may 
lead to a situation in which no SPO preference relation which contains > y 
exists. 

If yo is a WO, the requirement of 1 -compatibility and the computation of 
transitive closure are unnecessary. We first recall some basic properties of weak 
orders. 
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Proposition 3 Let y be a WO preference relation over a schema R and ~ the 
indifference relation generated by y. If x yy,y^z and z y w, then also x y z and 
yyw. 

Theorem 3 For every preference relations yo and y such that yo is a WO and 
y an SPO, the preference relation yo > >- is an SPO. 

Proof Clearly, >-'= >-q > as a subset of yo U >-> is irreflexive. To show tran- 
sitivity, consider t\ y' ti and ti y' tj. There are four possibilities: (1) If ?i >-q ti 
and ?2 yo tj, then t\ y$ ?3 and ?i >-' tj. (2) If >-o t% tj ^0 ?2 and ti y ?3, then 
also ?2 >~o f 3 or f 2 ^0 h (where ~o is the indifference relation generated by >~o). In 
either case, t\ yo t-$ and t\ y' t$ (the second case requires using Proposition [3j. (3) 
h ^0 h,t\ y ?2 and ti yo h'- symmetric to (2). (4) If ti ^0 t\,t\ y ti, t$ )fo h and 
ti y t-i, then f3 ^0 fl (by the negative transitivity of >-o) and t\ y t%. Thus fi y' t%. 

□ 

Let's turn now to Pareto composition. There does not seem to be any simple 
way to weaken the assumptions in Theorem^using the notion of 2-compatibility. 
Assuming that >-, yo, or even both are IOs does not sufficiently restrict the possi- 
ble interleavings of y and >-q in TC(yo <E>y) because neither of those two pref- 
erence relations is guaranteed to be preserved in TC(yo ® >-). However, we can 
establish a weaker version of Theorem[3] 

Theorem 4 For every preference relations yo and y such that both are WOs, the 
preference relation yo<E>y is an SPO. 

Proof Similar to the proof of Theorem[3] 

Proposition |21 implies that for all preference relations defined using ERO ipfs, 
the computation of the preference relations rC(VU^o), TC(yo > >-), as well 
as TC(y ® >-o) terminates. The computation of transitive closure is done in a 
completely database-independent way. 

Example 5 Consider Examples^and|4] We can infer that 

t\ = ("VW",2002) y c , ("Kia", 1997) = f 3 , 

because ("VW",2002) y Cl ("VW", 1999), ("VW", 1999) y Ci ("Kia", 1999), and 
("Kia", 1999) y Cl ("Kia", 1997). The tuples ("VW", 1999) and ("Kia", 1999) are 
not in the database. 

If the conditions of Theorems \l\ and [2] do not apply, Proposition |2| implies 
that for ERO ipfs the computation of TC(y U >- ), TC(y t> y) and TC(y ® >-o) 
yields some finite ipf C(t\,ti). Thus the irreflexivity of the resulting preference 
relation reduces to the unsatisfiability of C(t,t), which by Proposition Q] is a de- 
cidable problem for ERO ipfs. Of course, the relation, being a transitive closure, 
is already transitive. 
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Example 6 Consider Examples Q] and |2] Neither of the preference relations >-c, 
and yc 2 ls an interval order. Therefore, the results established earlier in this sec- 
tion do not apply. The preference relation >~c* = TC(yc Y ^ ^c 2 ) ls defined as fol- 
lows (this definition is obtained using Constraint Datalog computation): 

(m,y) >~c* (tn',y') = m = m' Ay > /V 

m = "VW" A mf ± "VW" t\y>y'. 

The preference relation >-c* is irreflexive (this can be effectively checked). It 
also properly contains >-c, U ^c 2 > because ?i >~c* h but ?i t^ and fi 73. The 
query (Oc*(Car) evaluated in the instance r\ (Figure QJ returns only the tuple t\. 



4.2 Weak orders 

Weak orders are practically important because they capture the situation where the 
domain can be decomposed into layers such that the layers are totally ordered and 
all the elements in one layer are mutually indifferent. This is the case, for exam- 
ple, if a preference relation can be represented using a numeric utility function. 
If a preference relation is a WO, a particularly efficien t (essentially single pass) 
algorithm for computing winnow is applicable (Cho04|). 

We will see that for weak orders the transitive closure computation is unnec- 
essary and minimal revisions are directly definable in terms of the preference re- 
lations involved. 

Theorem 5 For every 0-compatible WO preference relations y and yo, the pref- 
erence relations y U yo and y ® >-q are WO. 

Proof In view of LemmaQ] we can consider only y'= y U yo. 

Irreflexivity is obvious. For transitivity, assume t\ y' ti and ti y' t^. If t\ y 
h y h (resp. t\ y§ ti y§ t-$), then t\ y t$ (resp. t\ y$ t$) and t\ y' t^. If fi yo h 
and ti y ?3, we need O-compatibility to infer that ti ^ t\ and thus t\ y ti or t\ ~ ti 
(where ~ is the indifference relation generated by y). In both cases, we can infer 
t\ y ?3 and thus t\ y' ti,. The last case is symmetric to the previous one. 

For negative transitivity, consider t\ )/-' ti and ti )f' ?3. Then t\ ^0 h, h ^0 H, 
t\ )f t%, and t% )/- ?3. Consequently, t\ ^0 f3> h V~ f 3> an d tnus h h- O 

Note that without the O-compatibilit y assum ption, WOs are not closed with 
respect to union and Pareto composition (ChoQ3|). 

For prioritized composition, we can relax the O-compatibility assumption. This 
immediately follows from the fact that WOs are closed with respect to prioritized 
composition 

Proposition 4 For every WO preference relations y and y$, the preference rela- 
tion y$ > y is a WO. 

A basic notion in utility theory is that of representability of preference rela- 
tions using numeric utility functions: 
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Definition 8 A real-valued function u over a schema R represents a preference 
relation >- over R iff 

V*i,*2 [h yt 2 iSu(t 1 ) >u(t 2 )}. 

Such a preference relation is called utility-based. 

Being a WO is a necessary condition for the existence of a numeric represen- 
tation for a preference relation. However, it is not sufficient for uncountable orders 
<tFis70h . It is natural to ask whether the existence of numeric representations for 
the preference relations y and implies the existence of such a representation 
for the preference relation y'= {y$ 9 >~) where 9 6 {U, >, ®}. This is indeed the 
case. 

Theorem 6 Assume that y and y^ are WO preference relations such that 

1. y and yo are 0-compatible, 

2. y can be represented using a real-valued function u, 

3. yo can be represented using a real-valued function u$. 

Then y' = yo 9 y, where 9 6 {U, > , <8>}, is a WO preference relation that can be 
represented using any real-valued function u' such that for all x 

u'(x) = a ■ u(x) + b ■ uq(x) + c 

where a and b are arbitrary positive real numbers. 

Proof Assume x y' y. Thus x yo y or x y y. If x yo y, then uo(x) > uo(y). Also, in 
this case y y" x because of O-compatibility. This implies u(x) > u(y). Consequently, 
u'{x) > u'{y). The other case is symmetric. 

Assume u'(x) > u'iy). Thus uq(x) > uo(y) or u(x) > u(y). In the first case, we 
get x yo y, in the second, x y y. Consequently, xy' y. □ 

Surprisingly, the O-compatibility requirement cannot in general be replaced by 
1 -compatibility if we replace U by \> in Theorem [6] 

Example 7 Consider the Euclidean space M x S%, and the following orders: 

(x,y) y 1 (x',y')=x>x f , 
(x,y) y 2 (x',y')=y>y', 

The orders >-j and ^2 are 1 -compatible (but not 0-compatible) WOs. It is well 
known that their prioritized (also called lexicographic) composition is not repre- 
sentable using a utility function (Fis70). 

Thus, preservation of representability is possible only under O-comp atibility, 
in which case ^oU^ = ^ol>^ = ^o®^ (LemmaQ. (The results (I'is70) indicate 
that for countable domains considered in this paper, the prioritized composition of 
WOs, being a WO, is representable using a utility function. However, that utility 
function is not definable in terms of the utility functions representing the given 
orders.) 
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We conclude this section by showing a general scenario in which the union of 
orders occurs in a natural way. Assume that we have a numeric utility function u 
representing a (WO) preference relation >-. The indifference relation ~ generated 
by >- is defined as: 

x ~ y = u{x) = u(y). 

Suppose that the user discovers that ~ is too coarse and needs to be further refined. 
This may occur, for example, when x and y are tuples and the function u takes into 
account only some of their components. Another function uq may be defined to 
take into a ccount o ther components of x and y (such components are called hidden 
attributes (PFT03)). The revising preference relation >-o is now: 

xy y = u(x) = u(y) Auq(x) > u (y). 

It is easy to see that is an SPO 0-compatible with >- (but not necessarily a 
WO). Therefore, by Theorem[Othe preference relation y U >-o is an SPO. 

5 Incremental evaluation 

5.1 Query modification 

We show here how the already computed result of the original preference query 
can be reused to make the evaluation of the modified query more efficient. We will 
use the following result. 

Propositions (Cho03) If >-i and y 2 are preference relations over a relation 
schema R and >-j C y 2 , then for all instances r ofR: 

- C0y 2 (r) C COy^r); 

- (Oy 2 (co y] (r)) = 0)y 2 (r) if>~i and y 2 are SPOs. 

Consider the scenario in which we iteratively modify a given preference query 
by revising the preference relation using only union in such a way that the revised 
preference relation is an SPO (for example, if the assumptions of TheoremQJare 
satisfied). We obtain a sequence of preference relations >-i, . . ., y„ such that >-] C 

••• Qy n - 

In this scenario, the sequence of query results is: 

r =r,n = CO yi {r),r 2 = 0)y 2 (r), . . . ,r n = (Oy n (r). 
Proposition|5]implies that the sequence ro, r\, . . . , r„ is decreasing: 

ro 2 n 2 • • • 2 r„ 
and that it can be computed incrementally: 

r\ = CO yi {r ),r 2 = 0)y 2 {ri),...,r n = (Oy n {r n -\)- 

To compute r,, there is no need to look at the tuples in r — r,--i, nor to recompute 
winnow from scratch. The sets of tuples r\ , . . . , r n are likely to have much smaller 
cardinality than ro = r. 
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It is easy to see that the above comments apply to all cases where the revised 
preference relation is a superset of the original preference relation. Unfortunately, 
this is not the case for revisions that use prioritized or Pareto composition. How- 
ever, given a specific pair of preference relations y and y^, one can still effec- 
tively check whether TC(yo > y) or TC(yo ® y) contains y if the validity of 
preference formulas is decidable, as is the case for ERO formulas (Proposition 

5.2 Database update 

In the previous section we studied query modification: the query is modified, while 
the database remains unchanged. Here we reverse the situation: the query remains 
the same and the database is updated. 

We consider first updates that are insertions of sets of tuples. For a database 
relation r, we denote by A + r the set of inserted tuples. We show how the previous 
result of a given preference query can be reused to make the evaluation of the 
same query in an updated database more efficient. 

We first establish the following result. 

Theorem 7 For every preference relation y over R which is an SPO and every 
instance r ofR: 

(O y (rUA + r) = (Oy((O y {r)UA + r). 

Proof Assume t (Dy((Dy(r)UA + r) . Then either t G" (Oy (r) U A + r or there exists 
t' G (O y (r)UA + r such thatf' y t. In the first case, t g (Oy (r) mdt G" A + r.lft 
and t g A + r, then t G' (O y (rUA + r). If there exists t' G (0 y (r) such that t' y t, then 
also t G" (Oy(r L)A + r). In the second case, t' G rUA + r and thus t G" (Dy(r LlA + r). 

Assume t G" (Oy(rUA + r). Then either t G" rUA + r or there exists t' G r UA + r 
such that t' y t. In the first case, t (0 y ((0 y (r) UA + r). In the second case, if t' 6 
A + r, thenf (O y ((Oy(r)UA + r). So consider t' G r-A + r. If t e r butf ^A + r, then 
t $(0 y (r) U A + r and t ^(0 y ((0 y (r) U A + r). If t e A + r, then there exists t" G co y (r) 
such that t" y t. (t" may be t' or some element dominating t'.) Therefore, in this 
case also t G" (Oy((O y (r) UA + r). □ 

Consider now the scenario in which we have a preference relation y, which is 
an SPO, and a sequence of relations 

r = r,n =r Q UA + r ,r 2 = r x UA + r u ...,r n =r„_i \JA + r n _ x . 

Theorem0shows that 

(O y {n) = (O y {(Oy{ro)UA+r ) 
(0 y (r 2 ) = (Oy{(0y{r l )UA + r l ) 

(Oy{r n ) = (Oy{(Oy{r n ^)\J A+r n - X ). 

Therefore, each subsequent evaluation of winnow can reuse the result of the 
previous one. This is advantageous because winnow returns a subset of the given 
relation and this subset is often much smaller than the relation itself. 
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Clearly, the algebraic law, stated in Theorem^ can be used together with other, 
well-known law s of relational algebra and the laws specific to preference queries 
dCho03t IKH03I) to produce a variety of rewritings of a given preference query. To 
see how a more complex preference query can be handled, let's consider the query 
consisting of winnow and selection, COy(a a (R)). We have 

(O y (a a (rUA + r)) = co y {a a (r)Ua a (A + r)) = 0)y(cOy(a a (r))Lla a {A + r)) 

for every instance r of R. Here again, one can use the previous result of the query, 
(O y (a a (r)), to make its current evaluation more efficient. Other operators that 
distribute through union, for example projection and join, can be handled in the 
same way. 

Next, we consider updates that are deletions of sets of tuples. For a database 
relation r, we denote by A~r the set of deleted tuples. 

Theorem 8 For every preference relation y over R and every instance r ofR: 

(Oy(r) -A~r C (Dy(r — A~r). 

Theorem[8]gives an incremental way to compute an approximation of winnow 
from below. It seems that in the case of deletion there cannot be an exact law 
along the lines of Theorem^! This is because the deletion of some tuples from the 
original database may promote some originally dominated (and discarded) tuples 
into the result of winnow over the updated database. 

Example 8 Consider the following preference relation >-= {(a,bi),...,(a,b n )} 
and the database r = {a, b\ , . . . ,b n }. Then (0 y (r) = {a} but 

G)y(r-{a}) = {bi,. ..,b n }. 

6 Finite restrictions of preference relations 

6.1 Restriction 

It is nat ural to consider restrictions of preference relations to given database in- 
stances <TC02h . 

Definition 9 Let r be an instance of a relation schema R and y a preference rela- 
tion over R. The restriction [y] r of y to r is a preference relation over R, defined 
as 

[y] r = ynrxr. 

We write (x,y) 6 [y] r instead of x[>-] r 3> for greater readability. 

The advantage of using [y] r instead of y comes from the fact that the for- 
mer depends on the database contents and can have stronger properties than the 
latter. For example, [y] r may be an SPO, while y is not. Similarly, [y] r may be 
/-compatible with [>~o]r> while y is not /-compatible with y^. Therefore, restric- 
tions could be used instead of preference relations in the revision process. 

The following is a basic property of restriction. It says that the restriction to 
an instance does not affect the result of winnow over the same instance, so the 
restriction can be used in place of the original preference relation. 
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Theorem 9 Let r be an instance of a relation schema R and y a preference rela- 
tion over R. Then 

Proof We have [>-] r C r and thus (Oy(r) C ffl[ > _] r (r). In the other direction, assume 
t <j? (Dy(r). If t r, t 0)[^] r (r). If f G r and there exists t' G r such that f' >- f, then 
also (f',f) G [>-] r andr G'ffl^^r). □ 

We also establish that restriction distributes over the preference composition 
operators. 

Theorem 10 If r is an instance of a relation schema R, 9 G {U, >, ®], and y and 

>-0 are preference relations over R, then 

Proof We prove this result for 6 = >. The other cases are similar. 
We have the following equivalences: 

(x,y) g [y }r>[y] r = 

(x,y) G [y ] r V (y,x) h] r A (x,y) G [>-] r = 

x yoy G r Ay G rV (y xV x & rV y £ r) Ax y y Ax € r Ay € r = 
^ ^0 J A* £ r Ay £ r\/y )/-qxAx y y Ax G rAy G r = 
(jc >-o.y Vy ^o*A* >-y) A* G r Ay G r = 
(x,y) G [^o c> y] r . 

The preference revision studied earlier in this paper typically involved the 
computation of the of the revised preference relation defined as the transitive clo- 
sure TC(yo 6 y), where 6 G {U, >, <g>}, y is the original preference relation, and 

is the revising preference relation. We study several different ways of impos- 
ing the restriction of preferences to a relation instance. We consider the following 
preference relations: 

yi = TC(y Q 9y), 
>~2 = [TC(y Q e y)] r , 
y, = TC([y 9y] r ), 
y 4 = TC([y } r d[y} r ). 

We establish now some fundamental relationships between the preference re- 
lations >-i,>-2, >-3, and >-4. 

Theorem 11 Let 8 G {U,>,®}, and y and yo be preference relations over a 
schema R. Then for every instance r ofR: 

^4 = hChC y\, 

and there are relation instances for which the containments are strict. 
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Proof The equality of ^4 and ^3 follows from Theorem 1 101 For ^3 C y%, we 
have that 

hflHr Q^o9y, 

and 

[>- 9 y] r C r x r. 

Thus 

h=rC(h9Hr)Crxr, 

and 

rc(>-o >-) n r x r =^ 2 • 

The containment ^C^i follows from the definition of the restriction. 

An example where y^ C >~2 C >-i is as follows. Let >-= {(a,£>)}, ^0= 
{(b,c)}, r = {a,c}. Then [>-} r = [y ] r = 0- Thus also [y 9 y] r = [y ] r 9 [y] r = 
0, and ^3= 0. On the other hand, >-i= {(a,b), (b,c), (a,c)} and ^2= {( a ) c )}- 

□ 

Corollary 1 Let 9 G {U,>,®}, and y and y® be preference relations over a 
schema R. Then for every instance r of a R: 

0) yi {r) = 0)y 2 {r) C m y3 (r) = 0)y A {r), 

and for some cases the containment is strict. 

Proof Follows from Theorem [9] and Theorem In the example given in the 
proof of Theorem ll II we obtain (0y 2 (r) = {a} and (0y 3 (r) = {a, c}. □ 

We study now the order-theoretic properties of restriction. 

Theorem 12 Let 9 G {U, >,<8>}, and y and y$ be preference relations over a 
schema R. Then for every instance r ofR, y 1 is an SPO implies that y^. is an SPO, 
which implies that y$ is an SPO. There are cases in which the reverse implication 
does not hold. 

Proof Because ^2 Q >~u ^2 ls irrefiexive. Assume that x y% y and y >-2 Z- Then 
x y 1 y, y y 1 Z, x G r, y G r, and z G r. Therefore, x y\ z /\x E r Az E r, and x y-i z. 

The preference relation y \ = { (a, a) } is not an SPO (and can be obtained from 
some preference relations >-o and y using any composition operator). However, 
its restriction ^2= [^i]r f° r r = {b} is empty, and thus an SPO. 

Assume now ^0= {(o,b)} and >-= {(b,a)}. Consider 9 = U and r = {b}. 
Thus, y\= {(a,b),(b,a),(a,a),(b,b)} and >-2= {(b,b)} (so it is not an SPO). 
On the other hand, [>-o U y] r = and ^3= 0, too. Similar examples can be con- 
structed for the other composition operators. □ 

Unfortunately, for weak orders there is no property analogous to Theorem ll 21 
Subsequently, we examine the impact of restriction on compatibility. 

Theorem 13 Let y and yo be preference relations over a schema R. Then for 
every instance r of a relation schema R and every i = 0,1,2 if y is i-compatible 
with yo, then [y] r is i-compatible with [Vo]r- There are cases in which the reverse 
implications do not hold. 
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Proof For O-compatibility the situation is clear. If there are no 0-confiicts between 
>- and >-o, then there are no 0-conflicts between [>-] r and [>-o];-- However, for 
higher-level conflicts, the situation is more complicated. 

Assume now that y is 1 -compatible with >-o and consider a 1 -conflict between 
[>-] r and [Vo]f Then there are elements t\ , f 2 , 5 1 , • ■ • , s k of r such that 

{h,h) G N]r,(^,Jl) G [>-] r ,...,(s k ,ti) G [Hr, 

and 

Ol,*jfc) £ [>-o]r,---,(5l,f2) £ [>-0]r. 

Consider now any two elements jc and y among t\ , f 2 , 5 1 , • ■ ■ , s k sucn that (x,y) G 
[>-] r (resp.(x,y) G [>-o],-)- Clearly then also x y y (resp., x y$ y)- Assume (x,y) G 
[>-],. and (y,x) g" [>~o]r- Thus y ^0 So we obtain a 1-confiict between the prefer- 
ence relations y and >-o. 2-conflicts are analyzed in the same fashion. 

To see that the lack of 1 -conflicts between [y] r and [>~o]r does not imply the 
lack of 1 -conflicts between >- and yo, consider 

^0= {(c,a)} 

y= {(a,b),{b,c),(a,c)}, 

and r = {(a,c)}. Then [y} r = {(a,c)} and [>~o]r = {c,a}. There are no 1-conflicts 
between [y] r and [>-o]r but there is a 1-conflict between >- and yo. Analogous 
examples can be constructed for other kinds of conflicts. □ 

Finally, we compare the computational properties of >-i, >-2 and ^3. The pref- 
erence relation y 1 is recomputed only after preference revisions. The relation ^2 
is recomputed after every revision and every database update. The recomputation 
after an update uses >- 1 as a selection condition applied torxr (where r is the 
current relation instance). The relation ^3 is also recomputed after every revision 
and every database update. However, in the latter case the computation is more in- 
volved than for y%, because transitive closure of a finite binary relation needs to be 
computed. Overall, >- 1 represents the most stable and comprehensive preference 
information. Even if y% is stored, >-j needs to be kept up-to-date after preference 
revisions, since it is used in the recomputation of >-2 after an update. The prefer- 
ence relation ^3 can be stored, revised, and updated without any reference to y\. 
However, in this case some preference information is lost, c.f., Corollary^ 



6.2 Non-intrinsic preferences 

Non-intrinsic preference relations are defined using formulas that refer not only to 
built-in predicates. 

Example 9 The following preference relation is not intrinsic: 

xyp re fy = (x,y) ePref 

where Pref is a database relation. One can think of such a relation as representing 
stored preferences. 
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Revising non-intrinsic preference relations looks problematic. First, it is typ- 
ically not possible to establish the simplest order-theoretic properties of such re- 
lations. For instance, in Example[9]it is not possible to determine the irrefiexivity 
or transitivity of yp re f on the basis of its definition. Whether such properties are 
satisfied depends on the contents of the database relation Pref. Second, the tran- 
sitive closure of a non-intrinsic preference relation may fail to be expressed as a 
finite formula. Again, Example|9]can be used to illustrate this point. 

However, it seems that restriction may be able to alleviate the above problems. 
Suppose y is the original and the revising preference relations. Computing 
TC(}~q U y) may be infeasible, as indicated above. But computing TC([yo U y] r ) 
is not difficult, as [>-o U >-] r is computed by the first-order query 

(x y Q y\<xyy) Ax eRAye R. 

For other composition operators, the same approach also works because they are, 
like union, defined in a first-order way. 



7 Weak-order extensions 

Theorems [3] and [5] and Proposition |4] demonstrate that for weak orders one can 
prove stronger properties about revisions than for general partial orders. The 0- 
compatibility or the interval order requirements may be relaxed, and the transitive 
closure computation may no longer be necessary. 

So it would be advantageous to work with weak orders. Such orders can, for 
example, be obtained as extensions of the given SPOs. We show here h ow to ex- 
press the constructio n of we ak order extensions using Datalog-i rules ( A H V95l) 
and the Rule Algebra (IN88). Although not much can be shown in that framework 
about WO extensions of arbitrary SPOs, the construction of WO extensions of 
interval orders (IOs) can be guaranteed to terminate. 



7.1 Rules 

We define the application r(X) of a rule r to an input set of facts X in the standard 
way. 

Definition 10 Assume r is of the form 

A «— Bi,.. . ,S„,-iCi, . . .,~<C m . 

Then r(X) consists of all the facts t(A) such that t(B ( ) e X, i = l,...,n, and 
t(C/) $X, j = l,...,m, where T is a ground substitution. In an inflationary appli- 
cation r(X) is added to X. 

In this paper, we are dealing with infinite sets of facts represented by con- 
straints. However, the above definition of rule application still applies. From this 
definition, we can obtain a more operational definition that will tell us how to 
construc t the constraints in the head of the rule r from the constraints in the body 
(iKLPOOl) . 



22 



Assume that each goal Bj, i = 1 , . . . , n is described by a constraint /3, and each 
goal Cj, j = 1, . . . ,m by a constraint /y. Also denote by V the set of variables that 
occur only in the body of r. Then A is described by the formula 

3Kj3iA-"A/3„A-yiA"-A-v)fc. 

fro m whic h negation and quantifiers have been eliminated. 

(IN88) present a language called Rule Algebra (RA) which allows rule com- 
position. The syntax of RA expressions is defined as follows: 

Expr :: = r\ Expr ; Expr \ Expr U Expr \ Expr + , 

where r is a single rule. The symbol ";" denotes sequential and "U", parallel com- 
position. The superscript "+" denotes unbounded iteration. 

The application of RA expressions is defined as follows (IN88): 

- for a single rule it is defined as in Definition ! 101 

- {Fi;F 2 ){X) = F 2 {F X {X)\ 

- {F 1 UF 2 )(X)=F l {X)UF 2 (X), 

- F+(X) = {J i>0 F i (X). 

Like rule application, the application of RA expressions comes in two different 
variants: inflationary and non-inflationary. 

Rule Algebra can be implemented directly. However, (IN88) show also how 
to ma p Rule Algebra expressions to a class of locally-stratified logic programs 
(Prz88). This class requires a limited use of function symbols to implement coun- 
ters. 



7.2 Strict partial orders 

llFis85h presents a construction of a WO extension of a finite SPO. It is based on a 
very simple intuition. 

Assume we are given that x y y and y ~ z, or x ~ y and y y z- In a weak 
order one needs to be able to have also x y z in both cases (see Proposition |3j. 
Therefore, one could produce a WO extension y' of a given SPO y by supporting 
the derivation of the implied order relationships. Clearly, such derivation should 
avoid contradiction (x y' y and yy'x). 

Example 10 Consider the following order >-= {(a,c), (b,d)}. Thus a ~ d and 
b ~ c. So w could derive ay' b and by'a,a contradiction. 

We construct an extension y' of a given SPO y using a set of rules. Unfortu- 
nately, for infinite orders the construction does not always produce a weak order. 
The input preference relation y is described using a set of facts of the relation T 
of arity 2n where n is the arity of the database relation over which >- is defined. 
The output preference relation y' is also described as a set of facts of the relation 
T but those facts are computed using rule application. 

First, we have two rules Pn and P\ 2 for deriving new order relationships: 



Pu : T(x,z)^T(x,y)A-,T(z,y)A^T(y,z). 
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Pn ■ T(x,z) «- T(y,z)A^T(x,y) A ^T(y,x). 
Second, we have the conflict removal rule P2 : 

P 2 : T{x 7 y)^T{x,y)A^T{y,x). 

We note that the rules P 1 , P12 , P2 need to be applied in a specific order. We use 
the following Rule Algebra expression E\ (IN88; AHV95) 

Ei = ((Pn U Pn) ■ P 2 )+, 

applied to the input preference relation. In the rule P2 and the expression E\, the 
desired semantics is non-inflationary because we want to eliminate conflicts. 

Example 11 Consider the preference relation >-= {(a,c),(b,d)} from Example 
1101 Applying the rules Pi\ and Pi 2 we obtain the relation 

T(x,y) = x = a Ay 7^ a V x = b Ay 7^ b\/ x 7^ c /\y = c V x 7^ d Ay = d. 

This is not an SPO because, for example, we have T(a,b) and T(b,a). Applying 
the rule Pi, the conflict is removed, yielding 

T(x,y) = x = a Ay 7^ a Ay ^ b\/ x = b Ay ^ b Ay 7^ a 
\lx^cf\x^dAy = c\lx^cAx^df\y = d. 

which is a weak order. Thus, no further iterations are necessary. 

Denote by T\ the preference relation obtained at the end of the i-th stage in the 
computation of E\ . Clearly, if 7] is a weak order, then nothing new is produced 
at the next stage, i.e., 7/ + i = 7). However, the reverse implication does not have 
to hold for arbitrary SPOs. Therefore, in each stage i, ZJ needs to be separately 
checked for the weak order property (Proposition \l\ implies that the appropriate 
properties are decidable under the assumption that the input preference relation is 
described by an ERO preference formula). 

Example 12 Consider the following rational-order preference relation y adapted 
from (lFis85l) : 

xyy = x>y Ax j^O Ay ^0. 
The corresponding indifference relation ~ is defined as 

x~y = x = y\/x = 0\/y = 0. 

The relation >- is not a weak order but even the first iteration of the above rules 
fails to produce anything new. Consider any rational number b 7^ 0. There are 
numbers a and c such that a > b, b > c, a ~ and c ~ 0. So on the one hand 
we have initially T(b,c), -iT(c,0) and -iT(0,c), and applying the rule Pn we get 
T(b,0). But on the other hand we have T(a,b), ~^T(a,0) and ->r(0,a). Applying 
the rule Pn we get T(0,b). Therefore, the rule P2 does not derive T(b,0), T(0,b), 
or any other new fact. 
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It is an open question what kind of properties a preference relati on sho uld sat- 
isfy so that the condition 7J+i = T, implies the weak order property. (Fis85) shows 
that such an implication holds for SPOs over finite domains. Therefore, it also 
holds for finite restrictions of arbitrary SPOs (Section For a finite restriction 
[y] r a different way for constructing a weak order extension of [y] r is available 
through the use of ranking (Cho03). The "best" tuples - those in (Oy(r) - receive 
rank 1, the "second-best" rank 2 etc. Then the weak order extension y' is defined 
as 

x y' y = rank(x) < rank(y). 

7.3 Interval orders 

For interval orders, we can show stronger results about constructing WO exten- 
sions. We still use the Datalog-i/Rule Algebra framework but instead of the ex- 
pression E\ we use the following expression E 2 : 

E2 = {Pu ; ^i2) + - 

We will see that for E 2 the inflationary and non-inflationary semantics coin- 
cide. 

For simplicity, we identify here a preference relation with the set of facts of 
the T predicate describing it. 

Example 13 Consider Example 1 121 Applying the rule P\\ to the preference rela- 
tion y from this example (which is an interval order) yields the following prefer- 
ence relation >-': 

x y' y = x > y Ax ^ A y ^ Vx ^ Ay = 0. 
This relation is a total order, and thus also a weak order. 

Lemma 2 For every irreflexive preference relation X, X C Pu(X), X C Py^iX), 
and X C P l2 (P n (X)). 

Lemma 3 Assume X is an interval order preference relation. Then Pi\{X) and 
Pn(X) are interval order preference relations. 

Proof WLOG, consider Y = Pu(X). Clearly, Y is irreflexive. For transitivity, con- 
sider T(x,y) E Y and T(y,z) & Y. Then there is a z' such that T(x,z') EX, T(z\y) g 
X, and T(y,z') <£X. Similarly, there is az" such that T(y,z") EX, T(z",z) <£X, and 
T{z,z") qLX. Because X is an interval order, we have T(x,z") EX or T(y,z') E X. 
Assume the former. Then T(x,z) E Y. The preservation of the interval order con- 
dition can be shown in a similar way. □ 

Lemma 4 Let F = (P n ;P l2 ) and Y bean SPO. Then F (J) C Y iffY is a WO. 

Proof If Y is a WO, then 

Y=P n (Y)=P l2 (Pn(Y)). 

If Y is not a WO but an SPO, then there are x, y and z such that T(x,y) E Y, 
T{x,z)E'Y, T(z,x)#Y, T(y,z) 07 and T(z,y) Y. Thus T(x,z) E P n {Y) and by 
LemmaE T(x,z) E P u (Pn{Y)). Thus Fu^ii^)) D 
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The following theorem shows that finite termination of the evaluation of E% is 
equivalent to the weak order property. 

Theorem 14 Let X be an 10. For every i > 0, E 2 (X) = (Pii;Pn) + (X) equals 
(P n ;Pny(X) iff{P\ X ;Puj{X) is a WO. ' 

Proof Follows from Lemmas|2]|3] and|4| Note that for j < i, (Pn;Pu) j { x ) Q 
(Pu;P\2) l (X). It is essential that the given preference relation be an 10. Otherwise, 
an application of P\i',Pn may produce preference relations which are not SPOs 
and the equivalence in Lemma|4]may stop to hold. □ 

To explore the possible implementations of the Rule Algebra expression E2, 
we note first that Lernma[2]implies that for the rules P\\ and Pu inflationary and 
non-inflationary semantics coincide. Therefore, we can use i nflationary or non- 
inflationary languages for the implementation of E2 . ( AHV95D i ndicate that Rule 
Algebra expressions can be translated to Inflationary Datalog^ (GS86), a variant 
of Datalog that allows unstratified negation (necessary here because of the rules 
P\i and P12) at the price of having inflationary semantics. It is clear that Inflation- 
ary Datalog-i programs terminate on finite inputs. However, preference relations 
are typically infinite. Still, they are finitely representable using preference for- 
mulas, and thus we are dealing with the problem of termination of Inflationary 
Constraint Datalog^ programs. Fortunately, there are positive results established 
in this area in (KKR95), which, together with Theorem^] imply the following: 

Theorem 15 Every interval order preference relation y, defined using an ERO 
formula, has a weak order extension y', defined using an ERO formula. The for- 
mula defining y' can be computed in exponential time. 



8 Related work 

8.1 Preference change 

llHan9 5 ) presents a general framework for modeling change in preferences. Prefer- 
ences are represented syntactically using sets of ground preference formulas, and 
their semantics is captured using sets of preference relations. Thanks to the syntac- 
tic representation preference revision is treated similarly, though not identically, 
to belief revision (GR95), and some axiomatic properties of preference revisions 
are identified. The result of a revision is supposed to be minimally different from 
the original preference relation (using a notion of minimality based on symmetric 
difference) and sati sfy som e additional background postulates, for example spe- 
cific order axioms. (Han95) does not address the issue of constructing or defining 
revised relations, nor does it s tudy the properties of specific classes of preference 
relations. On the other hand, (Han95) discusses also preference contraction, and 
domain expansion and shrinking. 

In our opinion, there are several fundamental differences between belief and 
preference revision. In belief revision, propositional theories are revised with prepo- 
sitional formulas, yielding new theories. In preference revision, binary preference 
relations are revised with other preference relations, yielding new preference re- 
lations. Preference relations are single, finitely representable (though possibly in- 
finite) first-order structures, satisfying order axioms. Belief revision focuses on 
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axiomatic properties of belief revision operators and various notions of revision 
minimality. Preference revision focuses on axiomatic, order-theoretic properties 
of revised preference relations and the definability of such relations (though still 
taking revision minimality into account). 

( Wil97) considers revising a ranking (a WO) of a finite set of tuples with new 
informatio n, and shows that a new ranking, satisfyi ng the A GM belief revision 
postulates (GR95), can be computed in a simple way. (Rev97) describes a number 
of different revision operators for constraint databases. However, the emphasis 
is on the a xiomatic properties of the operators, not on the definability of revised 
databases, formulates vario us scena rios of preference revision and does 

not contain any formal framework. (Won94) studies r evision and contraction of 
finite WO preference relations by single pairs t\ >-o f2- (Fre04) describes minimal 
change revision of rational preference relations between prepositional formulas. 



8.2 Preference queries 

Two different approaches to preference queries have been pursued in the literature: 
qualitative and quantitative. In the qualitative approach, preferences are specifie d 
using binary preference relations (lLL87t iGJMOOt ICho02t ICho03t lKie02t IKK 02). 
In the quantitative util ity-based appro ach, preferences are represented using nu- 
meric utility functions iAWOft IhP04'I. as shown in Section|4] The qualitative ap- 
proach is strictly more general than the quantitative one, since one can define 
preference relations in terms of utility functions. Howe ver, onl y WO preference 
relations can be represented by numeric utility functions llFis70l) . Preferences that 
are not WOs are common in database applications, c.f., Example^ 

Example 14 There is no utility function that captures the preference relation de- 
scribed in ExampleQ] Since there is no preference defined between t\ and tj or ?2 
and f3, the score of ti, should be equal to the scores of both t\ and ?2- But this im- 
plies that the scores of t\ and t2 are equal which is not possible since t\ is preferred 
over ?2- 

This la ck of e xpressivene ss of the quantitative approach is well known in utility 
theory (Fis70). The paper (Cho03) contains an extensive discussion of the prefer- 
ence query literature. 

In the earlier work on preference queries ( C ho03l : lKie02 'K one can find positive 
and negative results about closure of different classes of orders, including SPOs 
and WOs, under various composition operators. The results in the present paper 
are, however, new. Restricting the relations y and >-o (for example, assuming the 
interval order property and compatibility) and applying transitive closure where 
necessary make it possible to co me up w ith positive counterparts of the negative 
results in JCho03h . For example, |ChoQ3|) shows that SPOs and WOs are in general 
not clo sed w.r.t. union, which should be contrasted with Theorems \l\ and [5] In 
(lKie02h . Pareto and prioritized composition are defined somewhat differently from 
the present paper. The operators combine two preference relations, each defined 
over some database relation. The resulting preference relation is defined over the 
Cartesian product of the database relations. So such operators are not useful in the 
context of revision of preference relations. On the other hand, the careful design 
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of the language guarantees that every preference relation that can be defined is an 
SPO. 

Probably the most thoroughly studied class of qualitative preference queries 
is the class of skyline queries. A skyline query partitions all the attributes of a 
relation into DIFF, MAX, and MIN attributes. Only tuples with identical values 
of all DIFF attributes are comparable; among those, MAX attribute values are 
maximized and MIN values are minimized. The query in Example ^ is a ver y 
simple skyline query ( BKS01 ), with Make as a DIFF and Year as a MAX attribute. 
Without DIFF attributes, a skyline is a special case of ra-ary Pareto composition. 

Various algor ithms for evaluating qualitative prefere nce queries are described 
in dCho03tlTC0l . and for evaluating skyline queries, in (BKSriTt |PTFS03t lBGZ04) 
(BG04) describes how to implement preference queries that use Pareto compo- 
sitions of utility-based preference relations. In Preference SQL (KK02) general 
preference queries are implemented by a translation to SQL. (HP04) describes 
how materialized results of utility-based preference queries can be used to answer 
other queries of the same kind. 



8.3 CP-nets 

CP-nets (BBD + 04) are an influential recent formalism for reasoning with condi- 
tional preference statements under ceteris paribus semantics (such semantics is 
also adopted in other work (MD04; WD91)). We co njecture that CP-nets can be 
expressed in the framework of preference relations of (Cho03), used in the present 
paper, by making the semantics explicit. If the conjecture is true, the results of the 
present paper will be relevant to revision of CP-nets. 

Example 15 The CP-net M = {a y a, a : b y b,a :b y b} where a and b are Boo- 
lean variables, captures the following preferences: (1) prefer a to a, all else being 
equal; (2) if a, prefer b to b; (3) if a, prefer b to b. We construct a preference 
relation yc M between worlds, i.e., Boolean valuations of a and b: 

(a,b) y Cu {a',b r ) =a = l Aa' =0Ab = b' 

V a = lAa' = lAb = lAb' = 
Va = 0Aa'=0Afe = 0Afc' = l. 

Finally, the semantics of the CP-net is fully captured as the transitive closure 
TC(yc M )- Such clo sure can be computed using Constraint Datalog with Boolean 
constraints (KLPOO). 

CP-nets and related formalisms cannot express preference relations over infinite 
domains which are essential in database applications. 



9 Conclusions and future work 

We have presented a formal foundation for an iterative and incremental approach 
to constructing ans evaluating preference queries. Our main focus is on query 
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modification, a query transformation approach which works by revising the pref- 
erence relation in the query. We have provided a detailed analysis of the cases 
where the order-theoretic properties of the preference relation are preserved by 
the revision. We have considered a number of different revision operators: union, 
prioritized and Pareto composition. We have also formulated algebraic laws that 
enable incremental evaluation of preference queries. Finally, we have studied the 
strengthening of the properties of preference relations through finite restriction 
and weak-order extension. 

Tables^and|2|summarize the closure properties of preference revision under 
union and prioritized composition. There is no separate table for Pareto composi- 
tion, because there are only few results specific to this kind of composition. 





y spo 


yio 


y WO 


>-o SPO 


not closed 


TC SPO if 0-compat. 


SPO if 0-compat. 


>-oIO 


TC SPO if 0-compat. 


TC SPO if 0-compat. 


SPO if 0-compat. 


>-o WO 


SPO if 0-compat. 


SPO if 0-compat. 


WO if 0-compat. 



Table 1 Revision using union 





y spo 


yio 


y wo 


y spo 


not closed 


TC SPO if 0-compat. 


SPO if 0-compat. 


y 10 


TC SPO if 1-compat. 


TC SPO if 1-compat. 


TC SPO if 1-compat. 


H)WO 


SPO 


SPO 


WO 



Table 2 Revision using prioritized composition 



Future work includes the integration of our results with standard query opti- 
mization techniques, both rewritin g- and c ost-based. Semantic query optimization 
techniques for preference queries (Cho04) can also be applied in this context. An- 
other possible direction could lead to the design of a revision l anguage in which 
richer classes of preference revisions can be specified (GMR93)- 

One should also consider possible courses of action if the original preference 
relation y and lack the property of compatibility, for example if >- and are 
not 0-compatible in the case of revision by union. Then the target of the revision 
is an SPO which is the closest to the preference relation y U yo. Such an SPO 
will not be unique. Moreover, it is not clear how to obtain ipfs defining the revi- 
sions. Similarly, one could study contraction of preference relations. The need for 
contraction arises, for example, when a user realizes that the result of a preference 
query does not contain some expected tuples. 

Finally, one can consider preference query transformations which go beyond 
preference revision, as well as more general classes of preference queries that 
involve, for example, ranking (Cho03). 
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